HBase, MapReduce, and Integrated Data Visualization for Processing Clinical Signal Data

نویسندگان

  • Andrew V. Nguyen
  • Rob Wynden
  • Yao Sun
چکیده

Processing high-density clinical signal data (data from biomedical sensors deployed in the clinical environment) is resource intensive and time consuming. We propose a novel approach to storing and processing clinical signal data based on the Apache HBase distributed column-store and the MapReduce programming paradigm with an integrated webbased data visualization layer. An integrated solution negates the need to marshal data into and out of the storage system while also easily parallelizing the computation, a problem that is becoming more and more important due to increasing numbers of sensors and resulting data. We estimate upwards of 50TB of clinical signal data for a 200-bed medical center within the next 5 years. Consequently, efficient processing of clinical signal data is a vital step towards multivariate analysis of the signal data in order to develop better ways of describing a patient’s clinical status.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Distributed Data over HBase in Big Data Analytics Platform for Clinical Services

Big data analytics (BDA) is important to reduce healthcare costs. However, there are many challenges of data aggregation, maintenance, integration, translation, analysis, and security/privacy. The study objective to establish an interactive BDA platform with simulated patient data using open-source software technologies was achieved by construction of a platform framework with Hadoop Distribute...

متن کامل

Hive, Pig & Hbase Performance Evaluation for Data Processing Applications

Information extraction has received significant attention due to the rapid growth of unstructured data. Researcher needs a low-cost, scalable, easy-to-use and fault tolerance platform for large volume data processing eagerly. It is very important to evaluate the MapReduce based frameworks for data processing applications. This paper leverages the comparative study of HBase, Hive and Pig.The pro...

متن کامل

Distributed RDF Triple Store Using HBase and Hive

The growth of web data has presented new challenges regarding the ability to effectively query RDF data. Traditional relational database systems efficiently scale and query distributed data. With the development of Hadoop its implementation of the MapReduce Framework along with HBase, a NoSQL data store, the semantics of processing and querying data has changed. Given the existing structure of ...

متن کامل

Distributed Storage and Processing Method for Big Data Sensing Information of Machine Operation Condition

The traditional relational database cannot satisfy the requirements of the high speed and real-time storage and processing for the distributed Big Data sensing information in the Wide Area Network environment. In this context, the No-SQL database HBase is used to store the big data sensing information of machine operation condition collected by Fiber Bragg Grating sensor network. The distribute...

متن کامل

Scalable Inverted Indexing on NoSQL Table Storage

The development of data intensive problems in recent years has brought new requirements and challenges to storage and computing infrastructures. Researchers are not only doing batch loading and processing of large scale of data, but also demanding the capabilities of incremental updates and interactive analysis. Therefore, extending existing storage systems to handle these new requirements beco...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011